Influential Gene Selection From High-Dimensional Genomic Data Using a Bio-Inspired Algorithm Wrapped Broad Learning System
نویسندگان
چکیده
The classification of high dimensional gene expression/ microarray data always plays an important role in various disease diagnoses and drug discovery. To avoid the curse dimensionality, selection most influential genes remains a challenging task for researchers machine learning field. As extraction features by bio-inspired algorithm is non-deterministic polynomial-time (NP-Hard) task, possibility applying new there. In this suggested work, recently developed algorithm, Monarch Butterfly Optimization (MBO), wrapped with Broad Learning System (BLS), called MBO-BLS, to choose classify simultaneously. first stage, pre-selection method (Relief) used select feature subset. Then, selected subset undergoes further execution MBO-BLS model. estimate efficacy presented model, six cancerous datasets are taken. Here, sensitivity, specificity, precision, F-score, Kappa, MCC measures impartial comparison. Further, prove supremacy method, basic BLS, Genetic Algorithm BLS (GA-BLS), Particle Swarm (PSO-BLS), existing ten models taken Moreover, examine designed model statistically, Analysis variance (ANOVA) test also performed here. From above qualitative quantitative analysis, it concluded that proposed outclasses other considering models.
منابع مشابه
Statistical Learning Methods for High Dimensional Genomic Data Statistical Learning Methods for High Dimensional Genomic Data Title: Statistical Learning Methods for High Dimensional Genomic Data
Due to their high-dimensionality, -omics technologies require the development of computational methods that are able to work with large number of variables. Each data type is characterized by its method of measurement and by the biological aspect under study. Understanding the data properties allows the design of sophisticated and effective computational models that are able to uncover and expl...
متن کاملBio-inspired Broad-class Phonetic Labelling
Recent studies have shown that the correct labeling of phonetic classes may help current Automatic Speech Recognition (ASR) when combined with classical parsing automata based on Hidden Markov Models (HMM). Through the present paper a method for Phonetic Class Labeling (PCL) based on bio-inspired speech processing is described. The methodology is based in the automatic detection of formants and...
متن کاملFeature Selection for Small Sample Sets with High Dimensional Data Using Heuristic Hybrid Approach
Feature selection can significantly be decisive when analyzing high dimensional data, especially with a small number of samples. Feature extraction methods do not have decent performance in these conditions. With small sample sets and high dimensional data, exploring a large search space and learning from insufficient samples becomes extremely hard. As a result, neural networks and clustering a...
متن کاملDe-Identification of Health Data in Big Data using a Novel Bio-Inspired Apoptosis Algorithm
part of this journal may be reproduced or used in any form or by any means without written permission from the publisher, except for noncommercial, educational use including classroom teaching purposes. Product or company names used in this journal are for identification purposes only. Inclusion of the names of the products or companies does not indicate a claim of ownership by IGI Global of th...
متن کاملGenomic Selection GENOMIC SELECTION USING A FAST EM ALGORITHM 2. ANALYSIS OF SIMULATED DATA
The paper reports on a fast EM algorithm for genomic selection by mapping QTL in genomewide dense SNP marker data. The algorithm called emBayesB was used to analyse a 6000 SNP dataset simulated for the QTLMAS XII workshop. True breeding value was accurately predicted by GEBV with a correlation of 0.85 in the validation data, while the regression coefficient of 0.97 indicated unbiased prediction...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3170038